Name | Version | Summary | date |
rl-replicas |
0.0.7 |
Reinforcement Learning Replications is a set of Pytorch implementations of reinforcement learning algorithms. |
2024-12-15 03:42:54 |
assume-framework |
0.5.0 |
ASSUME - Agent-Based Electricity Markets Simulation Toolbox |
2024-12-10 07:54:28 |
pufferlib |
2.0.3 |
PufferAI LibraryPufferAI's library of RL tools and utilities |
2024-12-09 01:09:54 |
gymnasium-2048 |
0.0.2 |
A reinforcement learning environment for the 2048 game based on Gymnasium |
2024-12-01 21:31:57 |
commonpower |
0.4.0 |
A package for the exploration of safe single/multi-agent reinforcement learning in smart grids. |
2024-11-26 12:22:42 |
SAC-pytorch |
0.0.9 |
Soft Actor Critic - Pytorch |
2024-11-19 22:09:42 |
jaxsim |
0.5.0 |
A differentiable physics engine and multibody dynamics library for control and robot learning. |
2024-11-15 17:47:57 |
streaming-deep-rl |
0.0.2 |
Streaming Deep Reinforcement Learning |
2024-11-15 15:34:42 |
nemo-aligner |
0.5.0 |
NeMo-Aligner - a toolkit for model alignment |
2024-11-14 23:55:58 |
ev2gym |
1.1.0 |
A realistic V2G simulator environment |
2024-11-13 15:55:34 |
xcs-rc |
1.2.4 |
Accuracy-based Learning Classifier Systems with Rule Combining |
2024-11-06 08:30:26 |
deep-hedging |
2.0.2 |
Hedging Derivatives Under Incomplete Markets with Deep Learning |
2024-10-29 22:19:35 |
ReplicantDriveSim |
0.4.8 |
A Unity Traffic Simulation |
2024-10-23 16:38:24 |
deeptrade-mbrl |
0.1.1 |
A simple trading system for backtesting Model Based RL strategies |
2024-10-19 16:35:28 |
active-pynference |
0.1.8 |
A Python implementation of an Active Inference engine using Sophisticated Inference. |
2024-10-14 07:37:56 |
rldec |
0.5.2 |
RLDec is a decomposition tool that analyzes the source code of a monolithic Java application and suggests the recommended microservices for each class in the system using a Deep Reinforcement Learning based method. |
2024-10-11 19:40:19 |
xminigrid |
0.9.1 |
JAX-accelerated meta-reinforcement learning environments inspired by XLand and MiniGrid |
2024-10-11 05:28:05 |
kheperax |
0.2.0 |
A-maze-ing environment in jax |
2024-09-25 20:11:51 |
contextual-bandits-algos |
0.1.0 |
A library for contextual multi-armed bandit algorithms. |
2024-09-15 22:36:26 |
memorial |
0.0.8 |
Replay Buffer Implementations for RL |
2024-09-15 05:37:19 |